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1. Introduction 1 

5— i , The general coordinate invariance underlying the theory of relativity creates basic problems in 

the analysis of the dynamics of the gravitational field. Usually, specification of the field amplitudes 
and their first time derivatives initially is appropriate to determine the time development of a field 
^ ■ viewed as a dynamical entity. For general relativity, however, the metric field g^ u may be modified 

at any later time simply by carrying out a general coordinate transformation. Such an operation 
does not involve any observable changes in the physics, since it merely corresponds to a relabeling 
under which the theory is invariant. Thus it is necessary that the metric field be separated into 
the parts carrying the true dynamical information and those parts characterizing the coordinate 
system. In this respect, the general theory is analogous to electromagnetic theory. In particular, 
the coordinate invariance plays a role similar to the gauge invariance of the Maxwell field. In 
the latter case, this gauge invariance also produces difficulties in separating out the independent 
dynamical modes, although the linearity here does simplify the analysis. In both cases, the effect 
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of invariance properties (both Lorentz and "gauge" invariance) is to introduce redundant variables 
in the original formulation of the theory to insure that the correct transformation properties are 
maintained. It is this clash with the smaller number of variables needed to describe the dynamics 
(i.e., the number of independent Cauchy data) that creates the difficulties in the analysis. In Lorentz 
covariant field theories, general techniques (Schwinger, 1951, 1953) (valid both in the quantum and 
classical domains) have been developed to enable one to disentangle the dynamical from the gauge 
variables. We will see here that, while general relativity possesses certain unique aspects not found 
in other theories, these same methods may be applied. Two important advantages are obtained by 
proceeding in this fashion. First, the physics of Lorentz covariant field theory is well understood. 
Consequently, techniques which relate back to this area of knowledge will help one to comprehend 
better the physics of general relativity. Second, insofar as quantization of the theory is concerned, 
a formulation closely associated with general quantization techniques which include consistency 
criteria (i.e., the Schwinger action principle) will be more appropriate for this highly non-linear 
theory. Direct correspondence principle quantization (which is suitable for linear theories without 
constraints) may well prove inadequate here. 

A precise determination of the independent dynamical modes of the gravitational field is arrived 
at when the theory has been cast into canonical form and consequently involves the minimal number 
of variables specifying the state of the system. At this level, one will have all the relevant information 
about the field's behavior in familiar form. The canonical formalism, involving only the minimal 
set of variables (which will turn out to be four), is also essential to the quantization program, since 
it yields directly simple Poisson bracket (P.B.) relations among these conjugate, unconstrained, 
variables. Two essential aspects of canonical form are: (1) that the field equations are of first order 
in the time derivatives; and (2) that time has been singled out so that the theory has been recast 
into 3+1 dimensional form. These two features are characteristic of Hamilton (or P.B.) equations 
of motion, in contrast to the Lagrange equations. The first requirement may be achieved in general 
relativity, since its Lagrangian may be written in a form linear in the time derivatives (which is 
called the Palatini form) . The type of variable fulfilling the second requirement is dictated by the 
desire for canonical form, and will be seen also to possess a natural geometrical interpretation. 

The use of the Palatini Lagrangian and of 3+1 dimensional notation does not, of course, impair 
the general covariance of the theory under arbitrary coordinate transformations. In possessing this 
covariance, general relativity is precisely analogous to the parameterized form of mechanics in which 
the Hamiltonian and the time are introduced as a conjugate pair of variables of a new degree of 
freedom. When in parameterized form, a theory is invariant under an arbitrary re-parameterization, 
just as general relativity is invariant under an arbitrary change of coordinates. The action of general 
relativity will thus be seen to be in "already parameterized" form. The well-known relations between 
the usual canonical form and the parameter description will thus provide the key for deriving the 
desired canonical form for the gravitational field. We will therefore begin, in Section 2, with a brief 
review of parameterized particle mechanics. In Section 3, the Lagrangian of general relativity will 
be cast into Palatini and 3+1 dimensional form, and the geometrical significance of the variables 
will be discussed. We will see then that relativity has a form identical to parameterized mechanics. 
Section 4 completes the analysis, to obtain the canonical variables and their relations as well as the 
P.B. equations of motion. 

Once canonical form is reached, the physical interpretation of quantities involved follows di- 
rectly as in other branches of physics. Thus, the canonical variables themselves represent the 
independent excitations of the field (and hence provide the basis for defining gravitational radi- 
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ation in a coordinate- independent way). Further, the numerical value of the Hamiltonian for a 
particular state of the system provides the primary definition of total energy (a definition which 
amounts to comparing the asymptotic form of the spatial part of the metric with that of the ex- 
terior Schwarzschild solution). Similarly, the total momentum is defined from the generator of 
spatial translations. The energy and momentum are invariant under coordinate transformations 
not involving Lorentz rotations at spatial infinity, and behave as a four-vector under the latter. It 
is also possible to set up the analysis of gravitational radiation in a fashion closely analogous to 
electrodynamics by introducing a suitable definition of the wave zone. In this region, gravitational 
waves propagate as free radiation, independent of the strong field interior sources. The waves obey 
ordinary (flat-space) wave equations and consequently satisfy superposition. The Poynting vector 
may also be defined invariantly in the wave zone. In contrast, the Newtonian-like parts of the metric 
cannot be determined within the wave zone; they depend strongly on the interior non-linearities. 
These points are discussed in Section 5. 

When the analysis is extended to include coupling of other systems to the gravitational field 
(Section 6), the above definition of energy may be used to discuss self-energy questions. In this 
way, the static gravitation and electromagnetic self-masses of point particle systems will be treated 
rigorously in Section 7. Here the canonical formalism is essential in order that one may recognize 
a pure particle (no wave) state. The vanishing of the canonical variables guarantees that there are 
no independent field excitations contributing to the energy. The total clothed mass of a classical 
electron turns out to be finite, independent of its bare mass and completely determined by its 
charge. Further, a "neutral" particle (one coupled only to the gravitational field) has a zero 
clothed mass, showing that the mass of a particle arises entirely from its interactions with other 
fields. The physical origin of these finite results is discussed at the beginning of Section 7 in terms 
of equivalence principle considerations. The self-stress T*- 7 of the electron vanishes, showing that 
the particle is stable, its repulsive electrostatic self-forces being precisely cancelled by gravitational 
attraction without any ad hoc compensation being required. Thus, a completely consistent classical 
point charge exists when gravitation is included. These rigorous results are in contrast to the higher 
and higher infinities that would arise in a perturbation analysis of the same problem. 

Whether gravitational effects will maintain the finiteness of self-energies in quantum theory 
(and if so, whether the effective cutoff will be appropriate to produce reasonable values) is at present 
an open question. In the final section (8), some speculative remarks are made on this problem. 
Since a complete set of P.B.'s has been obtained classically in Section 4, it is formally possible 
to quantize by the usual prescription of relating them to quantum commutators. However, the 
non-linear nature of the theory may necessitate a more subtle transition to the quantum domain. 
Section 8 also discusses some of these questions. 

2. Classical Dynamics Background 

2.1. Action principle for Hamilton's equations. As was mentioned in Section 1, general 
relativity is a theory in "already parameterized form." We begin, therefore, with a brief analysis 
of the relevant properties of the parameter formalism (see also Lanczos, 1949). For simplicity, we 
deal with a system of a finite number M of degrees of freedom. Its action may be written as 



where q = dq/dt and the Lagrangian has been expressed in a form linear in the time derivatives. 
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(This will be referred to as the first-order form since independent variation of pi and qi gives rise 
to the first-order equations of motion.) The maximal information obtainable from the action arises 
when not only pi and qi are varied independently, but t is also varied and endpoint variations are 
allowed. Postulating that the total 51 is a function only of the endpoints [SI = G(t\) — Gfa)] 
leads to: (1) the usual Hamilton equations of motion for p, L and qf, (2) conservation of energy 
(dH/dt = 0); and (3) the generating function 



Here 5qi = 5o<7i + qi St where S^qi denotes the independent ("intrinsic") variation of The gen- 
erating function can easily be seen to be the conventional generator of canonical transformations. 
Thus G q = J2iPi^Qi generates changes % — > qi + dqi , pi — > pi while Gt = —H5t generates the 
translation in time. That is, for G q one has [qj,G q ] = Yliilj > Pi]$Qi = ^9j> where [A, B] means 
the Poisson bracket (P.B.), and for Gt one has [qi,Gt] = — [qi,H]5t = —<jiSt by the P.B. form of 
the equations of motion. The above elementary discussion may be inverted to show that, for the 
action of (2.1), if every variable occurring in H is also found in the pq term, then the theory is in 
canonical form and pi and qi obey the conventional P.B. relations. This is the classical equivalent 
of the Schwinger action principle (Schwinger, 1951, 1953). 

2.2. The action in parameterized form. The motion of the system (2.1) is described in terms 
of one independent variable t (the "coordinate"). The action may be cast, as is well known, into 
parameterized form, in which the time is regarded as a function qu+i of an arbitrary parameter r: 



Here, q' = dq/dr, and the constraint equation pm+i + H(p,q) = holds. One may equally well 
replace this constraint by an additional term in the action: 



where N(t) is a Lagrange multiplier. Its variation yields the constraint equation R(pm+i,P, q) = 0, 
which may be any equation with the solution (occurring as a simple root) Pm+i = —H. The 
theory as cast into form (2.3) is now generally covariant with respect to arbitrary coordinate 
transformations f = f(r), bearing in mind that N transforms as dq/dr. The price of achieving 
this general covariance has been not only the introduction of the (M + l)st degree of freedom, but, 
more important, the loss of canonical form, due to the appearance of the Lagrange multiplier N in 
the "Hamiltonian," H' = NR. (N occurs in H' but not in Ylf^i 1 Pil'i-) A further striking feature 
which is due to the general covariance of this formulation is that the "Hamiltonian" H' vanishes by 
virtue of the constraint equation. This is not surprising, since the motion of any particular variable 
F(p,q) with respect to r is arbitrary, i.e., F' may be given any value by suitable recalibration 

T — ► T. 

2.3. Reduction of parameterized action to Hamiltonian form — intrinsic coordinates. As we 
shall see, the Lagrangian of general relativity may be written in precisely the form of (2.3). We 
will, therefore, be faced with the problem of reducing an action of the type (2.3) to canonical form 




(2.2) 





(2.3) 
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(2.1). The general procedure consists essentially in reversing the steps that led to (2.3). If one 
simply inserts the solution, pm+i = —H, of the constraint equation into (2.3), one obtains 



M 



^Piq'i - H(p,q)q' M+1 



1= I dr 

All reference to the arbitrary parameter r disappears when / is rewritten as 

M 



(2.4) 



-/ 



dqM+i 



^Pi{dqi/dq M +i) - H 



(2.5) 



which is identical to (2.1)with the notational change qM+i — > t. Equation (2.5) exhibits the role 
of the variable qM+i as an "intrinsic coordinate." By this is meant the following. The equation 
of motion for qM+i is <Zm+i = N(dR/dqM+i) from (2.3). Also, none of the dynamical equations 
determine N as a function of r. Thus N and hence qM+i, are left arbitrary by the dynamics (though, 
of course, a choice of qM+i as a function of r fixes N). One is therefore free to choose qM+iij) 
to be any desired function and use this function as the new independent variable (parameter): 
Qi = liilM+i), Pi = Pi(qM+i), i = I... M. The action of (2.5), and hence the relations between qi, 
Pi, and qM+i are now independent of r. They are manifestly invariant under the general "coordinate 
transformation" f = f(r) (for the simple reason that r itself no longer appears). The choice of qM+i 
as the independent variable thus yields a manifestly r-invariant formulation and gives an "intrinsic" 
specification of the dynamics. This is in contrast to the original one in which the trajectories of 
qi . . . qM+i are given in terms of some arbitrary variable r (which is extraneous to the system). 

In practice, we shall arrive at the intrinsic form (2.5) from (2.4) in an alternate way. Since 
the relation between gM+i and r is undetermined, we are free to specify it explicitly, i.e., impose a 
"coordinate condition." If, in particular, this relation is chosen to be qM+i = t (a condition which 
also determines N), the action (2.4) then reduces (2.5) with the notational change qu+i — ► t; 
the non-vanishing Hamiltonian only arises as a result of this process. [Of course, other coordinate 
conditions might have been chosen. These would correspond to using a variable other than qu+i 
as the intrinsic coordinate in the previous discussion.] 

This simple analysis has shown that the way to reduce a parameterized action to canonical form 
is to insert the solution of the constraint equations and to impose coordinate conditions. Further, 
the imposition of coordinate conditions is equivalent to the introduction of intrinsic coordinates. 

In field theory it will prove more informative to carry out this analysis in the generator. We 
exhibit here the procedure in the particle case: The generator associated with the action of (2.3) is 

M+l 

G = YjPi 6 ^- NR6t ( 2 - 6 ) 
i=l 

Upon inserting constraints, the generator reduces to 

M 

G = y^ j pjSq i - H8q M +i ■ (2.7) 

i=i 

Imposing the coordinate condition qM+i = t then yields (2.2) From this form, one can immediately 
recognize the M pairs of canonical variables and the non-vanishing Hamiltonian of the theory. 
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One can, of course, perform the above analysis for a parameterized field theory as well. Here 
the coordinates appear as four new field variables q M+IA = x^(r a ), and there are four extra momenta 
PM+n(T a ) conjugate to them. Four constraint equations are required to relate these momenta to the 
Hamiltonian density and the field momentum density, and correspondingly, there are four Lagrange 
multipliers A^(r Q ) for a field. An example in which the scalar meson field is parameterized may 
be found in III. 

3. First-Order Form of the Gravitational Field 

3.1. The Einstein action in first- order (Palatini) form. The usual action integral for general 
relativity 1 

/ = / d 4 xC= [ d A x \f—g R (3.1) 



yields the Einstein field equations when one considers variations in the metric {e.g., g^ v or the 
density q^ u = \f—g g^ u ). These Lagrange equations of motion are then second-order differential 
equations. It is our aim to obtain a canonical form for these equations, that is, to put them in 
the form q = dH/dp, p = —dH/dq. As a preliminary step, we will restate the Lagrangian so that 
the equations of motion have two of the properties of canonical equations: (1) they are first-order 
equations; and (2) they are solved explicitly for the time derivatives. The second property will be 
obtained by a 3 + 1 dimensional breakup of the original four-dimensional quantities, as will be 
discussed below. The first property is insured by using a Lagrangian linear in first derivatives. In 
relativity, this is called the Palatini Lagrangian, and consists in regarding the Christoffel symbols 
a v as independent quantities in the variational principle (see, for example, Schrodinger, 1950). 
Thus, one may rewrite (3.1) as 

Vx£TiV( r ) (3-2) 



where 

= [, iQ Q J , + ^ p ^ a . (3.3) 

Note that these covariant components R^ v of the Ricci tensor do not involve the metric but only 
the affinity T^ a v . Thus, by varying g^ v , one obtains directly the Einstein field equations 

G^v = R» v -\g^R = . (3.4a) 

These equations no longer express the full content of the theory, since the relation between the 
now independent quantities a v and is still required. This is obtained as a field equation by 
varying T^ a u . One then finds 

d^-a = JT ,« + S^Tp v a + Q^Vp » a - Q^T p P a = (3.46) 

which, as is well known, can be solved for a v to give the usual relation Y ^ a „ = = 

The Palatini formulation of general relativity has a direct analog in Maxwell theory, the affinity 
corresponding to the field strength and the metric to the vector potential A^. Here the 
Lagrangian is 

£_= A^F^ + \ F ilv F lxv (3.5) 

1 We use units such that 16-7T7C -4 = 1 = c, where 7 is the Newtonian gravitational constant; electric charge is in 
rationalized units. Latin indices run from 1 to 3, Greek from to 3, and x° — t. Derivatives are denoted by a comma 
or the symbol 9 M . 
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with and to be independently varied. The field equations then become 

F fiV = (3.6a) 

and 

A Vtl t - A„,„ = (3.66) 

which correspond to (3.4a) and (3.46). 

The next step in achieving canonical form is to single out time derivatives by introducing 
three-dimensional notation. For the Maxwell field, we thus define 

E i = F 0i . (3.7a) 

Since the canonical form requires equations of first order in time derivatives (but not in space 
derivatives), we may use the equation Fy = Aj^ — Aij to eliminate Fij. In terms of the abbreviation 

B^\e^{A kd -A^ k ) (3.76) 

the Lagrangian reads 

C = -E l d Ai - \ (B i B i + E i E i ) - A E i , t . (3.8) 

At this stage, the Maxwell equations are obtained by varying C with respect to the independent 
quantities E l , Ai, and AO- 
S' 2. Three-plus- one dimensional decomposition of the Einstein field. The three-dimensional 
quantities appropriate for the Einstein field are (as will be discussed in detail later) 

9ij = 4 9ij , N = (- V )" 1 / 2 , N t = 4 g 0i (3.9a) 



n ij = ^T~g (4 Fp \-g pq 4 Fr s <T)a V . (3.96) 

Here and subsequently we mark every four-dimensional quantity with the prefix 4 , so that all un- 
marked quantities are understood as three-dimensional. In particular, g %3 in (3.9b) is the reciprocal 
matrix to g, L j . The full metric 4 g fiu and 4 g^ may, with (3.9a), be written 

4 9oo = -(N 2 - N t N l ) (3.10) 

where N l = g tJ Nj, and 

4 g 0i = NyN 2 , 4 g 00 = -l/N 2 (3.11a) 

V j = g ij - {N^i/N 2 ) . (3.116) 

One further useful relation is 

\f^g = N^g. (3.12) 

In terms of the basic quantities of (3.9), the Lagrangian of general relativity becomes 



C = \h J 9 4 R = ~9ij d t 7T iJ - NR° - N,R l 

-2{^Nj - ivriV* + iVl^),* (3-13) 

where 

R° = - yfg [ 3 R + g-\\ vr 2 - tt^-)] (3.14a) 
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Ri = -2Ti ij b . (3.146) 

The quantity 3 R is the curvature scalar formed from the spatial metric gij, \ indicates the covariant 
derivative using this metric, and spatial indices are raised and lowered using g %3 and g^. (Similarly, 
7r = 7r%.) As in the electromagnetic example, we have allowed second-order space derivatives to 
appear by eliminating such quantities as Tj k j in terms of g%j k- 

One may verify directly that the first-order Lagrangian (3.13) correctly gives rise to the Einstein 
equations. One obtains 

d tgij = 2Ng- 1 / 2 {TT i:i - \ gij 7r) + N i{j + N j{i (3.15a) 

drfi = - N^fg- (%, - \ 3 R) + \ Ng' 1 ' 2 ^ {ir mn ir mn - ±vr 2 ) 

- 2iVo- 1 / 2 (7r im 7r m i - \^) + Jg (N& - A^ m , m ) (3.156) 

R»(g ij ,TT ij ) = . (3.15c) 

Equation (3.15a), which results from varying Tr tJ , would be viewed as the defining equation for 
7T 1 -' in a second-order formalism. Variation of N and iVj yields equations (3.15c), which are the 
4 G° n = 4 R° At — \ 5^ A R = equations, while equations (3.15b) are linear combinations of these 
equations and the remaining six Einstein equations ( 4 Gjj = 0). 

3.3. Geometrical interpretation of dynamical variables. Before proceeding with the reduction 
to canonical form, it is enlightening to examine, from a geometrical point of view, our specific choices 
(3.9) of three-dimensional variables. Geometrically, their form is governed by the requirement that 
the basic variables be three-covariant under all coordinate transformations which leave the t=const 
surfaces unchanged. Any quantities which have this property can be defined entirely within the 
surface (this is clearly appropriate for the 3+1 dimensional breakup). One fundamental four- 
dimensional object which is clearly also three-dimensional is a curve x^(A) which lies entirely 
within the 3-surface, i.e., x°(A) = const. The vector = dx^/dX tangent to this curve is therefore 
also three-dimensional. The restriction that the curve lie in the surface t=const is then v° = 0, 
and conversely any vector V^, with V° = is tangent to some curve in the surface. Three such 
independent vectors are Vf^ = Sf. Given any covariant tensor A^^ M , its projection onto the 
surface is then . . . V^A^^ M = Ai...j. Thus, the covariant spatial components of any four-tensor 

form a three-tensor which depends only on the surface 2 (in contrast to the contravariant spatial 
components which are scalar products with gradients rather than tangents, and hence depend also 
on the choice of spatial coordinates in the immediate neighborhood of the surface) . This accounts 
for the choice of g^, rather than A g^ . In contrast, N and N{ do not have the desired invariance 
and, in fact, by choosing coordinates such that the x l = const lines are normal to the surface, one 
obtains iVj = 0. (If a; is arranged to measure proper time along these lines, one has also N = 1.) 
By the same argument, one can see that Ai and Fij are appropriate three-dimensional quantities 
in a general relativistic discussion of the Maxwell field. 

The quantity which plays the role of a momentum is more difficult to define within the surface, 
since it refers to motion in time leading out of the original t = const surface. Such a quantity 

2 Of course, any three-dimensional operations (such as contraction of indices with the three-metric) on a three- 
tensor yield quantities defined on the three-surface. 
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is, however, provided by the second fundamental form (see, for example, Eisenhart, 1949), 
which gives the radii of curvature of the t = const surface as measured in the surrounding four- 
space. These "extrinsic curvatures" describe how the normals to the surface converge or diverge, 
and hence determine the geometry of a parallel surface at an infinitesimally later time. Since Ky 
describes a geometrical property of the t = const surface, as imbedded in four-space, it again does 
not depend on the choice of coordinates away from the surface. This may also be seen from a 
standard definition, Ky = —n^.jj, which expresses as the covariant spatial part of the tensor 
n^v) (the four-dimensional covariant derivative of the unit normal, n M = — N5^ to the surface). 3 
For convenience in ultimately reaching canonical form, we have chosen, instead of K^, the closely 
related variable tt 13 = —y/g (K 13 — g l3 K). Thus, the geometrical analysis defines gij and gij as 
suitable quantities, unaffected by the choice of coordinates later in time, while N and iVj describe 
how the coordinate system will be continued off the t = const surface. 

3.4. Initial value problem and dynamical structure of field equations. Returning to the field 
equations (3.15), we may now analyze them from the point of view of the initial value problem. If 
one specifies g^, tt' 13 and N, TVj initially, it is clear that the equations uniquely determine g^ and 
ix 1 ' 3 at a later time, while N and iVj remain undetermined then. Since the latter merely express the 
continuation of the coordinates, the intrinsic (coordinate-independent) geometry of space-time is 
determined uniquely by an initial choice of g^ and ir %3 . This choice is restricted, however, by the 
four constraint equations (3.15c) which relate these twelve variables at the initial time. Subject to 
these compatibility conditions, then, the (g^, tt ij ) constitute a complete set of Cauchy data for the 
theory. 

The maintenance in time of these constraints is guaranteed by the Bianchi identities (Xth°- v = 
0). Hence 

"/u ,0 — ;i — Lv + 1 n • [6. Lb) 

Thus, by virtue of the dynamic equations ^Gij = (and consequently of their spatial derivatives) 
at t = 0, the constraints ^G^ = hold at all times if they hold initially. 

In electrodynamics, the constraint equation corresponding to (3.15c) is obtained by varying Aq 
in (3.8), and is F 0i t i = V • E = 0. The identity ensuring its maintenance in time is just F^^v = 0, 
which may be rewritten as (i^i) o = ~{F' w ,u),i- The right-hand side then vanishes by the dynamic 
equations F' IU )U = 0. 

While the twelve variables (g^, tt 13 ) constitute a complete set of Cauchy data, they do not 
provide a minimal set (which the canonical formalism will eventually give to be two pairs, corre- 
sponding to two degrees of freedom). We may now count the number of minimal variables. Of 
the twelve g^, it' 13 , we may eliminate four by using the constraint equations (3.15c). There will 
correspondingly be four "Bianchi" identities among the twelve equations of motion (3.15a) and 
(3.15b). As we have seen, iV and Ni determine the continuation of the coordinate system without 
affecting the intrinsic geometry (i.e., the physics of the field). For every choice of N and iVj as 
functions of the remaining eight Cauchy data (which represents a choice of coordinate frame) , there 
will result four equations stating that the time derivatives of four of the remaining eight (g, tt) 
variables vanish. [More precisely, a choice of coordinate frame is made by specifying four functions 
(f of (g^, it 13 ) as the coordinates x M . The equations for dtq^ (= Sq) then determine TV and iVj.] 
Thus, after these coordinate conditions are imposed, we are left with four dynamic equations of the 

3 Thus one has Kij — —n^.j) = —n^j) + n^Ft M j — NTi j. 
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form dtu a = f a (u) (a = 1, 2, 3, 4). These equations govern the motion of a system of two degrees of 
freedom. This is to be expected, since the linearized gravitational field is a massless spin two field, 
and the self-interaction of the full theory should not alter such kinematical features as the number 
of degrees of freedom. 

3.5. The gravitational field as an already parameterized system. To conclude this section, we 
point out the characteristic properties of the Einstein Lagrangian of (3.13). We reproduce it here 
with a divergence 4 and a total time derivative 5 discarded: 

C = n ij d t9lj - NR° - N t R l , R" = W (g tj , ir ij ) . (3.17) 

Equation (3.17) is thus precisely in the form of a parameterized theory's Lagrangian as in (2.3). 
This form just expresses the invariance of the theory with respect to transformations of the four 
coordinates and hence the are parameters in exactly the same sense that r was in the particle 
case. That the iV and iV, are truly Lagrange multipliers follows from the fact that they do not 
appear in the pq' (i.e., tt ij dtgij) part of C Their variation yields the four constraint equations R^, 
= 0. The "Hamiltonian" 7i' = NR° + NiR 1 vanishes due to the constraints. The true non- vanishing 
Hamiltonian of the theory will arise only after the constraint variables have been eliminated and 
coordinate conditions chosen. The analysis leading to the canonical form is carried out in the next 
section. 

4. Canonical Form for General Relativity 

4.1. Analysis of generating functions. We are now in a position to cast the general theory 
into canonical form. The geometrical considerations of Section 3 were useful in obtaining the 
Lagrangian in the form (3.17), which, in the light of Section 2, we recognized as the Lagrangian of 
a parameterized field theory corresponding to (2.3). 

The reduction of (3.17) to the canonical form analogous to (2.1) requires an identification of the 
four extra momenta to be eliminated by the constraint equations (3.15c). To this end we consider 
the generator arising from (3.17): 

G = Jd 3 x [tt« S 9ij + T / 6x»] . (4.1) 

The T ^' 5x^ term comes from the independent coordinate variations. However, T ^' vanishes as 
a consequence of the constraint equations. 6 For example, T°q' = —NR° — NiR 1 = 0. When the 
constraints are inserted, G reduces to 

G = J d 3 x n ij S gij (4.2) 

[corresponding to (2.7)] where four of the twelve (gij, it 13 ) are understood to have been expressed 
in terms of the rest by solving R^ 1 = for them. This elimination exhausts the content of the 

4 A divergence in a theory with constraints cannot necessarily be discarded, since such a term may cease to be a 
divergence upon elimination of constraints. In IIIA, it is shown that the divergences neglected in this work are indeed 
to be discarded. 

5 The addition of a total time derivative in a Lagrangian does not, of course, alter the equations of motion and 
corresponds to a canonical transformation in the generator. The properties of canonical transformations in general 
relativity are discussed in IVA. 

6 More precisely TV reduces to an irrelevant divergence. 
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constraint equations. Finally, as in the particle case, coordinate conditions (now four in number) 
must be chosen and this information inserted into (4.2), leaving one now with only four dynamical 
variables (ir A , 4>a). If, in fact, the generator at this stage has the form 



G = I d 3 x 



A=l 



(4.3) 



then the theory is clearly in canonical form with tt a and 4>a as canonical variables and f T ^ 5x^ 
the generator of translations 5x^. In (4.3), % G IJ ['K A , 4>a\ has arisen in the elimination of the extra 
momenta pM+fi, by solving the constraint equations, and the now represent the four variables 
chosen as coordinates qu+^- 

For the Maxwell field, the generator corresponding to (4.2) is, by (3.8), 

G = j d 3 x[-E i 5Ai - ^(E i E i + B i B i )5t + (ExB) • Sr] 

with the constraint E l ^ = understood to have been eliminated. The solution of the con- 
straint equation is, of course, well known in terms of the orthogonal decomposition E l = E lT + 
E lL (V-E T = = VxE L ). The longitudinal component E lL of the electric field vanishes, and 
therefore the canonical form is reached when one inserts this information into G: 

G = j d 3 x[-E iT 5Ai T - ±(E iT E iT + £*£% + (E T xB) • St] . 

Note that Ai L has automatically disappeared from the kinetic term (by orthogonality), and never 
existed in B = VxA = VxA T . Thus, the two canonically conjugate pairs of Maxwell field 
variables are — E %T and Aj '. 

4.2. Analysis of constraint equations in linearized theory-orthogonal decomposition of the 
metric. In order to achieve the form (4.3) for relativity, it is useful to be guided by the linearized 
theory. Here one must treat the constraint equations to second order since our general formalism 
shows that the Hamiltonian arises from them. Through quadratic terms, equations (3.15c) may be 
written in the form 

9ij,ij ~ 9u,jj = ?V [gij , ^ } (4.4a) 

-2^j= V 2 i [9i j y j ] (4.46) 

where V2 and TV are purely quadratic functions of gij and tt^ . These equations determine one 
component of gij and three components of tt zj in terms of the rest. The content of equations (4.4) 
can be seen more easily if one makes the following linear orthogonal decomposition on g^ and tt 13 . 
For any symmetric array fij = fji one has 

fij = fij TT + fij T + (fi,j + fj,i) (4-5) 

where each of the quantities on the right-hand side can be expressed uniquely as a linear functional of 
fij. The quantities fij TT are the two transverse traceless components of fij(fij TT ,j = 0, fu TT = 0). 
The trace of the transverse part of fij, i.e., f T , uniquely defines fij T according to 

fij T = USijf T - (VV 2 )/ T «] (4.6) 
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(and clearly, fij T j = 0, fu T = f T . The operator 1/V 2 is the inverse of the flat space Laplacian, 
with appropriate boundary conditions. The longitudinal parts of fij reside in the remaining part, 
fij + fj t i. Decomposing fi into its transverse and longitudinal (curl-less) parts, one has fi = 
fi T + \f L ,i {fi T ,i = 0). The remainder then becomes fi T :i + fj T :i + f L yij . One may express fi, f T 
and fij TT in terms of fij by 

fi = (1/V 2 )[^, j - \ (1/V 2 )f kjykjt ] (4.7a) 
f = fa ~ (VV 2 )fe (4.76) 

fij = fij ~ fij [fmn] ~ {fi,j[fmn] + fj,i[fmn}} • (4-7c) 

The foregoing orthogonal decomposition for a symmetric tensor is just the extension of the usual 
decomposition of a vector into longitudinal and transverse parts employed in electromagnetic theory. 

Returning to (4.4), one has 

_ V V = V 2 ° (4.8a) 

-2V 2 (7r lT + 7r L 5i ) =7Y • (4.86) 

To first order, one sees that g T and it 1 vanish. 7 These structures begin, therefore, at second order 
where g T = -(l/V 2 )Wi in and -2(7r iT +7r L )i ) = (l/V 2 )T[f n . Here H Un , and are obtained from P° 
and V\ by setting g and ir l equal to zero there. They are just the linearized theory's Hamiltonian 
and field momentum densities. 8 The analysis has thus shown that the constraint equations can 
be solved for g T and 7r* (in terms of the remaining variables) as the four extra momenta. To see 
explicitly that H-im, and generate the appropriate time and space translations, one must return 
to the generator. Inserting the orthogonal decomposition (4.5) for both g^j and TT tJ into (4.2), one 
obtains 

G = Jd 3 x [^ TT 6 9lj TT + 7T ljT 5 9ij T + 2« J + n j ,i)5gij} . (4.9) 

The cross-terms in (4.9) have vanished due to the orthogonality of the decomposition (e.g., f d 3 x 
ir l i TT 5gij = — f d 3 x it 1 ! 7 " 1 \j 5gi = 0). We have also used here the fact that taking the variation of 
a quantity does not alter its transverse or longitudinal character in such a linear breakup and that 
the derivatives commute with the variation. Equation (4.9) may be brought to the desired form by 
further integration by parts and addition of a total variation: 9 

G = J d 3 * {^ TT S gi TT - (-VV)o[-(l/2V 2 )^] + [-2VV r + n L ,i)]}S gi (4.10) 

4.3. Imposition of coordinate conditions. Equation (4.10) is now in the form of (2.7). The 
final step in reduction to canonical form is to impose coordinate conditions. The structure of 
(4.10)suggests that one choose as coordinate conditions 

t = -(l/2V 2 )vr T (4.11a) 

7 We use boundary conditions such that g T and 7r l vanish asymptotically. Note also that 7r lT and tt l are indepen- 
dently determined by (4.8b). 

8 More precisely, Ttun differs from the linearized Hamiltonian density by a divergence which vanishes when coordi- 
nate conditions (4.11) are imposed. 

J Addition of a total variation to a generator corresponds to the addition of a total time derivative in the Lagrangian, 
as stated earlier. 
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x i = 9i . (4.116) 

Alternately, these coordinate conditions can be written in more conventional form 10 by eliminating 
7r T and <7j via (4.7): 

^■-t%- = ( 4 - llc ) 
9ijj = . (4.11d) 

One can see that these coordinate conditions are acceptable by looking at those of the field equations 
that involve dtg% and dt^ The linear part of equation (3.15a) gives, as the equations for the 
longitudinal part of o^, 

dt{9i, 3 + 9j,i) = N hj + N jti . (4.12) 

The Lagrange multipliers TV, = goi are functions determined only when coordinate conditions are 
imposed and must vanish at infinity where space is flat. Inserting (4.11b) into (4.12) gives, consistent 
with the boundary conditions, iVj = everywhere. Similarly, from (3.15b), one has 

^[-(l/2V 2 )vr T ]=iV. (4.13) 

Condition (4.11a) implies N = (— goo) -1 ^ 2 = lj again consistent with the required asymptotic limit. 

Alternately, one can see that equations (4.11) are physically appropriate coordinate conditions 
by a direct comparison with the known results of linearized theory as discussed in I. Thus, as 
mentioned above, H\i n = — V 2 g T and I^L = — 2V 2 ("7r lT + tt l > j) are the linearized theory's Hamilto- 
nian and momentum densities and so their coefficients in the generator (4.10) must be St and 5x % 
respectively, in order that the form (4.3), also obtained in I, be reproduced. 

4.4. Hamiltonian form and independent variables for linearized theory. Since the generator is 

now 

G = J d 3 x \^ TT S gij TT - H Vm (^ TT , mj TT )6t + Xt(7r^ TT , g l3 TT ) (4.14) 

the linearized theory has been put into canonical form, with gij TT and 7r* jTT as the two canonically 
conjugate pairs of variables. 

We will now see the usefulness of the linearized theory in suggesting the choice of canonical 
variables for the full theory. Since the identification is made from the bilinear part of the Lagrangian 
dtgij , which is the same as for the linearized theory, the greater complexity of the full theory, 
i.e., its self-interaction, is to be found only in the non-linearity of the constraint equations. Even 
in the constraint equations, the linearized theory will guide us in choosing g T and tt 1 as the four 
extra momenta to be solved for. 

4.5. Hamiltonian form and independent variables for the full theory. The full theory can now 
easily be put into canonical form. The generator of (4.10) is, of course, also correct for the full 
theory since it comes from the bilinear part of the Lagrangian. The constraint equations (3.15c) 
now read (in a coordinate system to be specified shortly) 

-V 2 / = V°[ 9lJ TT , ^ TT ; g T , tt*; g t , n T ] (4.15a) 
-2Vyr + ^L. )= ^ [ff .,TT ^tt. n i. gh nT] (4 156) 

10 The differential form (4.11c,d) of the coordinate conditions may be integrated to yield (4.11a,b) either by imposing 
appropriate boundary conditions or by a procedure given in Appendix B of III. 



13 



where are non-linear functions of gij and 7r* J . One can again solve these (coupled) equations 
(at least by a perturbation-iteration expansion) for g T and 7r\ Thus, one can again choose - V 2 g T 
and — 2V 2 (ir tT + TV L j i) as the four extra momenta to be eliminated. We denote the solutions of 
equations 

-V 2 / = -S o[^, ^ TT , 9i , n T ] (4.16a) 
-2V 2 (vr^ + 7^,0 = -^[g tJ TT , ^ TT , 9i , n T ] . (4.166) 
These equations are the counterpart of pm+i = —H in the particle case. 

As we have seen, the four constraint equations are maintained in time as a consequence of the 
other field equations. Hence, after inserting equations (4.16) into (3.15a,b), one finds that four of 
these twelve (those for dtg and dtir 1 ) are "Bianchi" identities, leaving eight independent equations 
in the twelve variables gij TT , 7t 1 ^ tt , gi, tt t and TV, and iVj. These equations are linear in the time 
derivatives of the first eight variables. 

We now impose the coordinate conditions (4.11) which determine ir T and gi. The dtg% and 
dtir T equations become determining equations for N and Ni [the full theory's analog of (4.12) and 
(4.13)]. N and iVj are no longer 1 and respectively, but now become specific functionals of gij TT 
and 7r ljTT , which could (in principle) be calculated explicitly. 11 In the last four equations, then, 
TV and iVj may, in principle, be eliminated, leaving a system of four equations involving only gij TT 
and 7r yTT , and linear in their time derivatives. We will now see that this reduced system is in 
Hamiltonian form. 

The generator (4.10) reduces to canonical form [with coordinate, conditions (4.11) imposed 
and constraints (4.16) inserted]: 

G = J d 3 x [w ijTT S gij TT + 1% St + T°i oV] (4.17a) 

while the Lagrangian now becomes 

£ = nUTT dtg ..TT + _ (4m) 

It can be shown that the solutions T ^ of the constraint equations do not depend explicitly on the 
coordinates of (4.11) (see III). This is not unexpected, since the variables gij and 7r u appearing 
on the right-hand side of (4.15) do not depend explicitly on the coordinates in this frame. (Thus, 
only gij = x l j = 5 l j and tt t = — 2V 2 i = appear in g^ and 7r u .) 

4.6. Fundamental Poisson brackets and P.B. equations of motion for general relativity. With 
the generator now in canonical form, we can immediately write down the fundamental equal time 
P.B. relations for gij TT and 7r* jTT . These are 12 

[ ffi / T (x), vr m " TT (x')] = <T%-(x - x') (4.18a) 

b,/ T (x), g mn TT ^)] = = [vr^(x), vr-^(x')] . (4.186) 

The S mn ij(x) in (4.18a) is a conventional Dirac (5-function modified in such a way that the transverse- 
traceless nature of the variables on the left-hand side is not violated. Note that the definition of 
this modified (5-function does not depend on the metric; it is symmetric, transverse, and traceless 
on each pair of indices: 

11 An example of such an explicit determination of N and Ni for coordinate conditions (4.22) is given in (7.7) and 
(7.9). 

12 The variables being used as the coordinates (4.11a,b) of course have vanishing P.B. with all variables. 
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S mn.. = 5 nm.. = gmn.. = gij ^ (4J g a) 

r m ij = = S mn u (4.196) 
8™ a j = . (4.19c) 

From the form of equations (4.17), one also has the P.B. equations of motion: 

d t9ij TT = [ 9ij TT , H) = 5H/5^ TT (4.20a) 

d t ^ TT = [tt^ tt , H) = -5H/5 9i] TT (4.206) 

where H = — J d 3 x T°o is the Hamiltonian. The last equalities in equations (4.20) follow from 
equations (4.18). That (4.18) and (4.20) are consistent with the Lagrangian equations obtained by 
varying (4.17b) is now immediate. 13 Corresponding to the time translation equations (4.20), one 
also has for spatial displacements 

d k g i3 TT = [P k ,gij TT ] = -6P k /5^ TT (4.21a) 

d k ^ TT = [P k y jTT ] = 5P k /5 9ij TT (4.216) 

where P k = + J T ^ d?x is the total momentum operator. With the canonical momentum P^ c = 
J d?x 1° k c = — f d 3 x TT mnTT g m n TT , k , equations (4.21) are obvious. The quantities P k and P k c 
actually coincide, since their respective integrands differ by a divergence of canonical variables, as 
shown in IIIA. 

4.7. Alternate canonical form arising from different coordinate conditions. The canonical 
form of equations (4.17) is, of course, not unique. First one can make the usual type of canonical 
transformations among the canonical variables gij TT and 7r ljTT , leaving the coordinate conditions 
fixed. A more general class is available, however: the generator (4.2) can also be reduced to 
canonical form with coordinate conditions other than (4.11). That the reduction can always be 
carried out is shown in Appendix A of III. An example which will be of use later is given by the 
coordinate conditions 

x l = 9l - (1/4V 2 ) T ; (4.22a) 
t = -(l/2V 2 )(vr T + V 2 7r L ) (4.226) 

or, in differential form, 

gij,j kk 4 g k j, k ji 4 gjj, kk i = o (4.22c) 

tt u = . (4.22d) 

In this frame, the canonical variables are gij TT and tt 1 ^ tt (where the TT symbol now refers to the 
coordinate system (4.22) and hence these canonical variables are different from the set previously 
considered). The T° M in this frame are the solutions of the constraint equations (4.16) for V 2 g T 
and — 2V 2 (tt iT + tt l which will be different functions of the new canonical variables, since again 
T and L are referred to (4.22) rather than (4.11). This frame is of interest since, for gij TT = , the 
metric gij reduces to the isotropic form, i.e., gij = (1 + | g T )5ij. We will discuss, in Section 5, a 
preferred class of physically equivalent frames which includes the two given here. 

13 The primary consistency criterion on the canonical reduction is that equations (4.20) be identical to the Einstein 
equations in the coordinate frame (4.11); this is established in IIIA. 
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It is perhaps worth noting that, with the coordinates of (4.22), the orthogonal decomposition 
of gij takes on the simple form 



9ij = 9ij TT + Sij(l + \g T [g-, 



TT 



7T 



.mnTT 



(4.23) 



Thus, the dynamical aspects of the theory are, as expected, to be found in the deviation of the 
metric from its flat space value. By tracing (4.23), one obtains a local relation giving the canonical 
variable gij TT at a point in this coordinate system in terms of at that point: gij TT = 9ij — \ b~ij9kk- 
The g T term, as we have seen, is directly given by the Hamiltonian density, and, as will be shown 
in the next section, carries the asymptotic (Newtonian) m/r term in the metric. 

7-5. Energy and Radiation 

5.1. Expressions for the energy and momentum P u of the gravitational field. The canonical 
formalism developed in the previous section has brought out the formal features of general relativity 
which have their counterpart in usual Lorentz covariant field theories. As a consequence, the 
physical interpretation of the gravitational field may be carried out in terms of the same quantities 
that characterize other fields, e.g., energy, momentum, radiation flux (Poynting vector). 

The energy E of the gravitational field is just the numerical value of the Hamiltonian for 
a particular solution of the field equations. In obtaining this numerical value, the form of the 
Hamiltonian as a function of the canonical variables is irrelevant, and one may make use of the 
equation, T°o = V 2 g T to express E as a surface integral. One has then 14 



where dSi is the two-dimensional surface element at spatial infinity. Similarly, the total field 
momentum Pi may be written as 



In (5.1) and (5.2) we have assumed, as throughout, that the coordinate system is asymptotically 
rectangular. The basic requirement for an energy to be at all defined is, of course, that space-time 
become flat at spatial infinity (so that the integrals in (5.1,2) are well defined). The rectangularity 
requirement may then be conveniently imposed to avoid unnecessarily complicated expressions. 

5.2. Coordinate invariance of energy-momentum expression; Lorentz four-vector nature of P^. 
The usefulness of formulas (5.1) and (5.2) is enhanced by the fact that, in evaluating the surface 
integrals at infinity, it is not necessary to employ there the original canonical coordinate frame. At 
spatial infinity, where the metric approaches the Lorentz value, coordinate transformations, which 
preserve this boundary condition, must approach the identity transformation (excluding, for the 
moment, Lorentz transformations at infinity). For the transformation x M = x M — ^ then, one has 

14 It should be emphasized that, while the energy and momentum are indeed divergences, the integrands in the 
Hamiltonian and in the space translation generators 1° M [g TT , ir TT ] are not divergences when expressed as functions 
of the canonical variables. 






(5.2) 
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£,^,1; —> (since only dx^/dx" occurs in the transformation laws of and V p a a ). Let us first keep 
only the linear terms in the transformation: 

9ij = 9ij + C,j + ¥ ,i (5-3a) 

^ = ^ -{8^,11 (5-36) 

For the entire formalism, it is necessary that g^ u — rj^ u and T p a a fall off at least as ~ 1/r at spatial 
infinity. Hence, jV , must have a similar behavior. As can be seen from (4.5), these transformations 
affect only gi and tt t leaving g T ij, and the canonical variables invariant to 0(l/r). However, 

in (5.1) and (5.2), one needs to know g 1 ' $ and n 1 j , to 0(l/r 2 ), so that one must also investigate 
the quadratic terms in the transformation law. These present a problem only if a derivative acting 
on ^ „ does not alter its order, i.e., if one has a "coordinate wave" ^ e % ^ x jr whose derivatives 
are again ~ 1/r. A detailed investigation (IVC) shows that expressions (5.1) and (5.2) for P^ are 
invariant under all transformations preserving g^ u — rj^ ~ 1/r, provided one averages the surface 
integrals over the oscillatory terms. The prescription of averaging is justified, since the energy and 
momentum can be defined directly from the leading ~ 1/r terms of g T and ir 1 , in the coordinate 
frame (4.11). [For instance, from (4.16a), one sees that g T = (l/V 2 )T°o ~ E/Airr.\ However, as 
shown in IVC, these 1/r terms are in fact coordinate invariant and the averaged P^ integrals agree 
with them. Thus, by either prescription, our formula for P^ is unaffected by changes of coordinates 
not involving Lorentz transformations at spatial infinity. 

The definition of energy and momentum, as given by the canonical formalism, may be shown 
to coincide with that arising from the stress energy pseudotensor of Landau and Lifshitz. In this 
definition, the energy and momentum can also be given in surface integral form. (Here, too, one 
must average over any oscillatory terms to obtain a coordinate- independent result.) The linear 
terms in this pseudotensor are then just the surface integrals in (5.1) and (5.2), while the higher 
powers are negligible at infinity (using the averaging). This agreement also enables us to note 
directly that P M behaves as a Lorentz vector under rigid Lorentz transformations of the coordinate 
system, since this property is manifest for the pseudotensor expression. That the energy of a system 
should be defined from the asymptotic metric alone follows from the equivalence principle, the total 
gravitational mass being measurable by the acceleration of a test particle at infinity. 

Since P M is constant, due to the fact that T ^ does not depend on x M , explicitly, it can 
be evaluated at any given time. For this purpose, one should need only those initial Cauchy 
data required to specify the state of the system uniquely. Prior to the imposition of coordinate 
conditions, these are gij and it 13 (and not, for example, go^) as we have seen in Section 3. The last 
members of (5.1) and (5.2) are so expressed. When any (asymptotically rectangular) coordinate 
conditions are imposed, one needs only the two pairs of canonical variables of that frame to specify 
the state of the system and, of course, to calculate P M (using the formula P^ = J a^rl ^). 

5.3. Conditions for equality of Hamiltonian and energy — "Heisenberg representation. " As was 
discussed in the previous section, there are an infinite number of coordinate conditions that may 
be invoked to put the theory in canonical form. The question therefore arises as to whether the 
above physical results are affected by the coordinate conditions chosen. While, for example, we 
have seen that the P^ of (5.1) and (5.2) is invariant under change of frame, this does not prove 
that different canonical forms, arising from different frames, will yield the same integral for P^. 
Indeed, not every canonical form of the theory has the same value of the Hamiltonian, as one is 
free to perform Hamilton-Jacobi (H.-J.), or partial H.-J., transformations. One must therefore. 
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take as primary, frames and canonical variables which are in the "Heisenberg representation." 
More precisely, we require that in the expression giving the full metric g„v at any time in terms 
of the canonical variables at that time, there be no explicit coordinate dependence. The preferred 
role is given to the metric in this requirement since basic measurements (rods and clocks) refer 
to g^v directly. Similarlyin standard particle mechanics, the directly measured quantities (which 
appear in the original Lagrangian) are the basic "Heisenberg" variables, and one requires that any 
other "Heisenberg" set be related to them in a time-independent fashion. More generally, even if 
the original Lagrangian (which always defines the Heisenberg representation) is not in canonical 
form, the same requirement is to be imposed. For relativity, the basic variables in the Einstein 
Lagrangian are g^u- (Note that the question of whether a given canonical form is Heisenberg thus 
does not require a comparison with some particular canonical form, and also that each of the two 
forms in Section 4 is indeed Heisenberg.) With this criterion, it is clear that the Hamiltonian 
densities will also have no explicit coordinate dependence, since the canonical equations can be 
obtained by substituting 5^[ff can , 7r can ] into the original field equations. This step produces no 
explicit coordinate dependence in the Heisenberg representation. It can be shown (see IVA) that all 
Heisenberg forms are linked by coordinate transformations of the type = + f^[g c {x), ir c (x)] 
where the / M do not depend on x^ explicitly. Thus the relation between Heisenberg frames is 
determined by the intrinsic geometry and does not contain functions of the coordinates, which are 
independent of the physics of the state. As a result of this form of the transformations, one finds 
that P M is numerically the same in any Heisenberg frame for a given physical situation. Thus (5.1) 
gives correctly the numerical value of the Hamiltonian of any Heisenberg frame and could have 
been derived in any Heisenberg canonical form. 

5.4. "Waves" as excitations of canonical variables. Excitations of the canonical modes of a 
field provide a definition of what one calls waves. This is, of course, the same definition as that given 
in electrodynamics. In electrodynamics, the gauge invariance partly obscures the fact that only the 
transverse parts A T and E T of the vector potential and electric field need be examined to recognize 
waves. These variables, as we saw in Section 4, are just the canonical ones for the Maxwell field. 
Correspondingly, in general relativity we may utilize the non-vanishing of the canonical variables 
as the criterion of the existence of waves. As in electrodynamics, when sources (among which are 
included the non-linear self-interactions) are present, the radiation and induction effects can be 
meaningfully separated only in the "wave zone"; but also as in electrodynamics, the wave concept 
can be employed consistently nearer the sources where it is of heuristic value, although the definition 
is somewhat arbitrary. 

An example of an application of the wave criteria is furnished by the time-symmetric case 
(-/r 1 - 7 = initially). In the frame of (4.22), it was seen that 7Tij TT = if, and only if, the metric 
is isotropic, gij = (1 + \ g T )Sij. One need not explicitly transform to this frame to verify whether 
the canonical variables vanish in it; it is sufficient to examine the three-tensor (see, for example, 
Eisenhart, 1949, Section 28): 

Rijk = Rij\k - Rik\j + \ (9ikR,j - 9ijR,k) (5.4) 

which vanishes if, and only if, gij vanishes in the frame 15 of (4.22). A class of time-symmetric 

15 In terms of the canonical formalism, the time-symmetric situation Tv lj = can be viewed as the requirement 
that excitations in the canonical momentum variable 7r ljTT vanish. This follows from the fact that if -k % ^ tt = 0, 
then 7T l = by the constraint equations (3.15c); consequently, 7r T = in the frame of (4.22). Thus, in this frame, 
7T ljTT = implies TT 13 = 0. 
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waves is furnished by the initial conditions recently discussed by Brill (1959). 



5.5. Radiation and definition of wave-zone. The concept of radiation embodies the idea of 
field excitations escaping to infinity and propagating independently of their sources. In linear field 
theories, the region in which this occurs (the wave zone) is simply one which is many wavelengths 
away from the source. In general relativity, for strong fields, nonlinear effects are important, and 
act as effective sources, so that merely going beyond the matter sources is not, in general, adequate 
to define free radiation. To have a wave zone situation, one must set up further conditions such that 
the non-linear self-interaction terms do not influence the propagation of the waves. Such further 
conditions are also necessary in quantum electrodynamics due to vacuum polarization. The latter 
brings in effective non-linear contributions to the Maxwell action, so that in the presence of an 
arbitrary external electromagnetic field, the self-coupling can produce distortion of the "waves" 
(Delbruck scattering); again, scattering of two photons can occur, even in the absence of external 
fields, thus affecting superposition, that is, free propagation. The usual definitions of radiation 
would thus fail in such situations and, in fact, the "wave zone" is then defined only when the 
self-coupling is negligible. One must therefore require that in the wave zone, the various field 
amplitudes be small when the theory is non-linear. On the other hand, the process of going to the 
asymptotic wave zone region is not an a priori equivalent to taking the linearized approximation. 
In general relativity, we have previously seen that (g T , ir % ) ~ p^/r asymptotically, where P M is 
clearly determined by the interior region. Thus, components of the metric other than the wave 
modes survive asymptotically, and in fact may be comparable in size to the wave modes. 

To define the wave zone, we consider a general situation in which the gravitational canonical 
modes behave as fe lkx /r asymptotically in some region (with wave numbers k up to some maximum 
fcmax); beyond some point (the wave front) they are assumed to vanish very rapidly. (This last 
assumption is needed to make the energy of the waves finite.) The first wave zone requirement is 
the familiar one that kr ;§> 1; this implies that gradients and time derivatives acting on the canonical 
modes are also ~ 1/r. Second, we demand that for all components of the metric, \g^ v — n^l ~ 
a/r <C 1. This criterion can always be met (for radiation escaping to infinity) by taking r large 
enough, and waiting for the wave to reach this distance. This condition implies that, not only is the 
wave disturbance weak (f/r -C 1), but also the Newtonian-like parts of the field (e.g., g T ) are small 
(P^/r < 1). It insures that quadratic (and higher) terms in g^ u — n^ v , are negligible compared 
to the linear ones. Finally, we impose the condition that \dg/d(kx)\ 2 <C \g — r]\. In terms of k m \ n , 
the minimum frequency (or wave number) to be considered (with, of course, k m \ n r 3> 1), one has 
kmin r ^ fcmax(o/r) 1 / 2 . Again, for large enough r, this inequality can always be achieved. [For fixed 
r, it sets a lower bound on frequencies which may be treated in the wave zone.] The purpose of 
this requirement is to make the non-linear structures containing gradients also small compared to 
the leading linear terms. The last two conditions together are arranged to guarantee the absence 
of all self-interactions in the wave zone. 

5.6. General structure of gravitational radiation; superposition, coordinate-independence. 
With the above definition of the wave zone, one finds (see IVB) that the field equations (3.15a,b) 
reduce to 



dtQij 
d t -K ij 




(5.5a) 



(5.56) 



19 



where 0(l/r 2 ) is much less than the leading terms. Using a result of IVB that the orthogonal parts 
of ~ 1/r and ~ 1/r 2 structures are also at most ~ 1/r and ~ 1/r 2 respectively, one finds that 

d m TT = 2^ TT (5.6a) 

dt ^ TT = \V 2 9ij TT (5.66) 

to the order 1/r. Thus the rigorous dynamical modes obey the linearized theory's equations in the 
wave zone. Equations (5.6) are, of course, source-free, so that the radiation propagates indepen- 
dently of its origin, and without any self-interaction or dependence on the Newtonian-like parts 
g T and ir l of the field. The flat-space wave equation with constant coefficients, □ 2 fl a tfl'ij TT = 0, 
represented by (5.6), indicates the absence of curvature effects on the radiation. Further, there 
are no coordinate effects left, since the coordinate-dependent components </j, 7r T , and go^, have 
disappeared. 

We have seen previously that gij TT and tt 1 ^ tt are invariant to the order 1/r under coordinate 
transformations leaving the metric asymptotically flat and not involving Lorentz transformations. 
Thus, in a fixed Lorentz frame, g TT and ir TT represent a coordinate-independent description of the 
radiation. The detailed analysis of IVB shows how any coordinate waves which may be present can 
be identified unambiguously. 

The above derivation may be carried through equally well in the presence of coupling to matter 
(e.g., the Maxwell field), provided of course, the matter system is bounded within the interior 
region (and has finite energy content). Under these circumstances, any gravitational waves in the 
wave zone are still independent of the matter sources. If, however, electromagnetic radiation is 
also propagating in the wave zone with small enough amplitude (and ~ 1/?*), the gravitational 
radiation is still free, being unaffected to ~ 1/r by the electromagnetic waves, since the latter 
couple quadratically (~ 1/r 2 ) in the field equations. Hence, in the common wave zone, the systems 
are correctly independent. 

5.7. Measurables in the wave zone: the Poynting vector, radiation amplitudes. Relation to 
curvature tensor. Since the dynamics of the rigorous canonical modes is identical, in the wave zone, 
to that given by linearized theory, any radiation arising from strong interior fields can be precisely 
simulated by a corresponding solution of the linearized theory with sources. Such an analog metric 
can be made everywhere weak, so that linearized theory is everywhere valid, and one may arrange 
the sources so that in the wave zone, the correct g T and it 1 result. An absorber of gravitational 
radiation in the wave zone clearly cannot distinguish between the true and analog situations. One 
is therefore free to use the Poynting vector of linearized theory to measure the energy flux. From 
the symmetric stress-tensor of I, one finds straightforwardly that 

-Min = 71 { 2 9li,m ~ 9lm,i) ( 5 - 7 ) 

in the coordinate frame of (4.11). However, it is easy to see that the general form of Tj;°, is 
coordinate-invariant through terms of 0(l/r 2 ), as is the right-hand side of (5.7). Hence (5.7) 
provides an invariant formula for the wave-zone Poynting vector. We remark that, as in electro- 
dynamics, the physical part of the Poynting vector is obtained by averaging over oscillatory terms. 
This averaging is of use in proving that XJ;° is coordinate- independent, since it also eliminates 
"coordinate wave" effects. It may also be noted that, in (5.7), no terms involving g T , it 1 have sur- 
vived to 0(l/r 2 ), showing that the Newtonian-like parts do not affect the energy flux of radiation. 
Similarly, in electrodynamics E L does not contribute to ExB since E L ~ 1/r 2 . 
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The canonical variables gij TT and tt 1 ^ tt are, in general, non-local functions of the metric and 
its first derivatives. However, it is possible to obtain their 1/r terms by measurements entirely 
within the wave zone. From the condition kr » 1, it is possible to isolate sufficiently accurately 
the k Fourier component of gij and dtgij, and to obtain gij TT (k), ir^ TT (k) algebraically from 
these. Equivalently, one may measure the fc-component of the curvature tensor Ra^vp to obtain 
these physical quantities since it may be shown that in the wave zone 



g tj TT = (2/k 2 )R z m mj (5.8a) 

irijTT = -{ikek^R, ^ . (5.86) 

Such measurements find a parallel in Maxwell theory, where E T , A T are found from E and B by 

E T = E — k[(k • E)/k 2 ] (5.9a) 

A T = i(kxB)/k 2 . (5.96) 



In fact, electromagnetic wave measurements are commonly of this type. 

It should be noted that just as the definition of electromagnetic radiation is not restricted to 
outgoing radiation (but may include mixtures of outgoing and incoming waves) , this also is true of 
the above definition of gravitational radiation. For the purely outgoing case, the 1/r part of the 
curvature tensor is of type II — Null in Petrov classification (Petrov, 1954, and Pirani, 1957). We 
have seen in this section that the leading (~ 1/r) physical terms in the field past the wave front 
are g T and ir 1 , and that these yield invariantly the energy- momentum vector of the system. In 
the wave zone itself, the dynamical modes specifying the radiation, as well as the Poynting vector, 
could be obtained invariantly. Actually, a good deal of detailed information about the strong fields 
in interior regions is obtainable by purely asymptotic ( "S'-matrix" ) measurements, 16 so that a large 
portion, at least, of the description of a system is manifestly frame-independent, and can be gotten 
without the need for introducing apparatus into strong field regions and hence having to analyze 
its behavior there. 

6. Extension to Coupling with Other Systems 

6.1. Lagrangian for coupled Einstein- Maxwell- charged particle system. Up to now we have 
considered the free gravitational field, sketching in the process the main physical features which 
have emerged there. The extension to coupling enables us to verify that these characteristics remain 
unchanged and provides a basis for the self-energy calculations of the next section. The analysis is 
completely analogous to the discussion of the free field and so we limit ourselves to a brief outline 
for the case of coupling to electrodynamics with point charges (more details may be found in V 
and VA). The addition to the Einstein Lagrangian is 

C M = \ (A^ - A^)r u + \ (~ 4 g) 1/2 r u d a/3 V ^ 
+ J dse(dx^(s)/ds)A fl (x) 5 4 (x-x(s)) 

+ J ds {K^dxP/ds) - \ \'(s)(tt^ u V"' + m 2 )}5 4 (x - x(s)) . (6.1) 

16 J. Plebanski (private communication) has shown how properties of orbits of test particles may be examined by 
purely asymptotic measurements (using geodesic projections) even if the orbits remain entirely within the strong field 
interior region. 
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We are here employing a first-order formalism for the particle as well as for the Maxwell field. 
The Maxwell part of Cm is the straightforward covariant generalization of (3.5), with the field 
strength ^ u now a tensor density. Thus, and ^ u are varied separately, tUS £1X6 X (s), the 
mechanical momentum tt^(s), and the Lagrange multiplier X'(s). The particle Lagrangian is in 
parameterized form (with arbitrary parameter 17 s), as is required for manifest covariance. The 
<5 4 -function is a scalar density, invariantly defined in a metric-independent way by f <5 4 (x)d 4 x = 1; 
the 5 3 -function is similarly defined in three-space, i.e., J 5 3 (r)d 3 r = 1. Introducing the notation 
gi — gOi anc j = 1 e l i k (A k j — Aj y k) (where e* jfc = = 0, ±1 is totally antisymmetric) and using 
the gravitational variables of Section 2, one obtains 

Cm = A, 6 S i + 5 3 (r - r(t))[(^(t) + eA^dx^dt) + vr°] 
+ A [e5 3 (r-r(t))-£\ l ] 

-\ Xtfimnj - N- 2 (tt - N^i) 2 + m 2 ]5 3 (r - r(t)) 

-\ Ng-^g.j [£*£3 + &&] + iV J [e ijk £W k ] . (6.2) 

We have also performed the s integration which led to the replacement of A' by 

A = [\'(s)(ds/dx°(s)] x o( s=t y It is convenient next to the eliminate the non-gravitational constraints 

obtained from varying Aq and A. These are: 

£\ i = e6 3 (r-r(t)) (6.3a) 

7r^7r M + m 2 = (6.4a) 

whose solutions are 

£ iL = -V(e/4vr|r - r(t)|) (6.36) 

7r = ATVi - Nig^TTiiTj + m 2 ) 1/2 . (6.46) 

Here, we have again used the orthogonal breakup of £ 1 in the same flat-space sense as in Section 4. 
Therefore, £ % is to be regarded subsequently as an abbreviation for £ lT + £ tL , with £ lL expressed 
from (6.3b). 

6.2. Canonical reduction of coupled system. At this stage,the matter Langrangian is in the 
reduced form 

Cm = (S tT ) d Ai T + [ Pi dx l /dt]5 3 {v - r(t)) 
-\Ng- 1 l*g ij [E>Si + B i &] 

- N[g^( Pi - eA t T )( Pj - eA?) + m 2 ] 1/2 5 3 (r - r(t)) 

+ N i [e ijk £iB k + ( Pi - eAi T )8 3 (r - r(t))] (6.5) 

where pi = 7Ti + eAi T . The first two terms of Cm are in the standard Spg form and thus (Ai T ', —£ iT ) 
are the canonical variables for the electro-magnetic field, while P i(t), x l (t) are those of the particle. 
Note that the variables Ai, £\ pi or x l , which arose naturally in obtaining canonical form, are 
also the appropriate variables from the geometrical (Cauchy problem) considerations of Section 3. 
Thus, Ai, pi, Hi are covariant spatial components of the corresponding four-vectors, while x l {t) 
locates the particle within the three-surface. The three-vector density character of £ l is established 

17 Equation (6.1) is covariant against any reparameterization s = s(s) with A' transforming as a "vector": A' = 
X'(ds/ds) (just as N does in Section 2). 
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from its relation to the covariant spatial components of the four-dimensional dual "F^ u of the field 
strength: 

ci _ \ ijk *p — 1 Jjk(l r*[iu\ 

The total Lagrangian, which is the sum of Cm and the gravitational part (3.13), now has precisely 
the parameterized form of (2.3), as a consequence of the general covariance of each part. The 
coefficients of N and N l now yield the extended constraint equations 

gR+ l n 2_ 7r i j7r . j = ^3 (r _ r(t))[ ^ (K _ e ^.T )( ^ (66a) 

-2tt^' = 5 3 (r - r(*))(Pi - eA, T ) + e^£ fe . (6.66) 
Equations (6.6) can again be solved for 

V V = 1% 

and 

-2VV T + 7r^) =T° 4 . 

When the coordinate conditions (4.11) are again imposed, the theory is in canonical form with the 
same gravitational canonical variables. One sees most clearly now how the gravitational canonical 
variables characterize independent gravitational field excitations. All the other components of the 
metric, e.g., g T and N, N l depend on the non-gravitational variables as well. In contrast, only 
gij TT and 7r ljTT can be specified initially, irrespective of the excitations of the matter system. 

The quantity — T% now represents the Hamiltonian density of the total system 18 and depends 
only on the canonical variables of all three parts. Further, the expressions (5.1) and (5.2) now give 
the energy-momentum of the total system. The fact that the total energy of the interacting system 
is obtainable from purely metric quantities is analogous to the expression in electrodynamics of the 
total charge in terms of the integral of the longitudinal electric field. It is worth noting that the 
choice of coordinate conditions is unaffected by coupling. Finally, we record the equations of motion 
of the gravitational field including coupling. The extension of the constraint equations (3.15c) has 
been given by (6.6) above; the dtg%j equations (3.15a) are unchanged, since they are of the type 
mi = p (namely, they are defining equations for Christoffel symbols). However, one must now add 
a term \ N%m % ^ to the right-hand side of (3.15b) for dtir^ , where Im^ is the symmetric spatial 
stress-energy tensor density of the matter: 

W = g lj g jk V9 4 l M ik 

= <5 3 (r - r(t))( P l - eA iT )(p j - eA jT )[(p - eA T f + m 2 ]' 1/2 

+ g~ 1,2 \\ g ij (£ m £ m + B rn B m ) - &s> - &&] . (6.7) 

7-7. Static Self-Energies 

7.1. Physical basis for finiteness of self-energy when gravitation is included. The canonical 
formalism developed for coupled systems in the previous section allows one to define pure particle 
states by the vanishing of the canonical variables referring to field excitations. For particles at rest, 
therefore, the energies of such states are just the total rest and interaction energies, which include, 
of course, their self-energies. 

1 In the flat space limit, this Hamiltonian density is correctly the usual one for electrodynamics. 
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The aim of classical point electron theory, since its inception, has been to obtain a finite, 
model-independent electromagnetic self-energy, and if possible, to dispense with mechanical mass 
altogether. Thus the total mass of the particle would arise from its coupling to the field. Such a 
program, however, was not feasible, since the self-energy diverged linearly, with no realistic com- 
pensation possible. In terms of renormalization theory, this implied an infinite "bare" mechanical 
mass. Since gravitational interaction energy is negative on the Newtonian level, it may be expected 
to provide compensation. Indeed a simple argument yields a limit on the self-energy due to just 
such a compensation. Consider a bare mass m$ distributed in a sphere of radius e. In the New- 
tonian limit, the total energy (i.e., the clothed mass) m is given by m = mo — ^jmo 2 /e. For 
sufficiently small e, m could become zero and then negative. In general relativity, the principle of 
equivalence states that it is the total energy that interacts gravitationally and not just the bare 
mass. Thus, as the interaction energy grows more negative, were a point reached where the to- 
tal energy vanished, there could be no further interaction energy. Consequently, there can be no 
negative total energy, in contrast to the negative infinite self-energy of Newtonian theory. General 
relativity effectively replaces mo by m in the interaction term: m = mo — \ ^ym 2 /e. Solving for m 
yields m = 7~ 1 [— e + e 2 + 2^/moe) 1 ^ 2 ], which shows that m — > as e — ► 0. 

More interesting is the fact that the gravitational interactions produce a natural cutoff for 
the Coulomb self-energy of a point charge. Here the self-mass resides in the Coulomb field 
7j f(e/4-irr 2 ) 2 d 3 r. By the general argument above on gravitational compensation, one expects 
that the Coulomb energy near the origin (which is, in fact, "denser" than the neutral particle's 5- 
function distribution) will have a very strong gravitational self-interaction, resulting in a vanishing 
total contribution to the self-mass from the region near the origin. Thus, the integral effectively 
extends down only to some radius a, yielding rriEM = \ J a °° (e/4irr 2 ) 2 d 3 r = (e 2 /47r)/2a. We can 
determine this effective flat-space cutoff a by the same equivalence principle argument. Without 
the gravitational contribution, the mass is mo + ^(e 2 /4-7re), so the total clothed mass is determined 
by the equation m = mo + \ e 2 /Aire — \ jm 2 /e. This yields 

m = 7 _1 {-e + [e 2 + 2m e 7 + [e 2 /^] 1 / 2 } . (7.1) 

We will see below that this formula is a rigorous consequence of the field equations. In the limit 
e — > , we have m = (e 2 /47r) 1 / 2 7 -1 / 2 . The bare mechanical mass m again does not contribute to 
the clothed mass. Our result is then equivalent to a cutoff a = \ (e 2 /47r) 1 / 2 7 1 / 2 on the flat-space 
Coulomb integral. 

7.2. Calculation of charged and neutral particle self-energies from initial data. We begin the 
self-energy calculation, then, by obtaining a solution of the field equations for a pure one-particle 
state, i.e., a state containing no independent excitations of the gravitational or electromagnetic 
fields in the rest frame. This requires that gij TT = tt^ tt = A lT = £ lT = pi = on the t = 
const surface where the energy is being computed. With the coordinate conditions of (4.22), we 
are therefore dealing with the time-symmetric situation ix %3 = 0. According to (4.23), then, the 
metric is isotropic; it is convenient to write it as gij = x 4 ( r ) From Section 7-5, the energy is 
the coefficient of l/(327rr) in the asymptotic form of x{ r )- The field equation determining g T and 
hence x 1S (6.6a). One has 19 

V9 3 R = -8 X V 2 X = m 5 3 (r) + \ X ~ 2 £ lL £ iL (7.2a) 

19 Professor L.N. Cooper has pointed out to us that (7.2a) with e = [— 8xV 2 x = Po(r), where po is any bare 
mass distribution] can be obtained by simple equivalence principle arguments starting from Newtonian theory. The 
Poisson equation V <f> = 47T7po = \ po for the gravitational potential (f>, must be corrected to include the particle's 
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and the electric field E L by (6.3b) is 

£ lL = (-e/4*r),i 



(7.3) 



A formal solution of (7.2a) may be found by setting \ 2 = 4 }2 — 4> 2 (Misner and Wheeler, 1957). 
One finds 

-8 X V 2 X = S{(jN 2 (j) - V'V 2 ^) + 8(V- 2 - 2 )~ 1 (^V^ - i)V(j)) 2 

= m <5 3 (r) + \ - 2 )- 1 (E L ) 2 . (7.26) 
If one then makes the assumption 

E L = 4(0VV> - ^V0) (7.4) 

with 4> = e/16nr and (1 +m/32-7rr) = -ip, (7.4) correctly reproduces (7.3). Equation (7.2b) then 
determines the total mass m to be the e — > limit of (7.1): 

m = lim 16vr{-e + [e 2 + (e/87r) 2 + m e/87r] 1/2 } . (7.5) 

e— >0 

In (7.5) the parameter e has been introduced by setting 5 3 (r)/r equal to <5 3 (r)/e, and is thus 
essentially the "radius" of the J-function. [This interpretation of the 5-function corresponds to 
viewing <5 3 (r) as the limit of a shell distribution 5(r — e)/4-7rr 2 of radius e. In V, it is shown that 
the results of this section are, in fact, independent of the model chosen for 5 3 (r) in the point 
limit.] The result (7.5) is that m = 2|e| and hence the total mass is finite and independent of the 
bare mechanical mass. The gravitationally renormalized electrostatic self-energy is now finite. The 
analysis also points out that mass only arises if a particle has nongravitational interaction with a 
field of finite range. 20 For example, an electrically neutral particle coupled to a Yukawa field would 
acquire a mass by virtue of this coupling. 

A solution may also be obtained for the case of two particles of like charge. The results are 
consistent with those obtained above for one particle (see V). The energy is just 2(|ei| + | ©2 1 ) , 
that is, just the sum of the individual masses, independent of both the mechanical masses and the 
interparticle separation ryi- The absence of an interaction term can be traced to the cancellation of 
the Newtonian attraction of the masses with the Coulomb repulsion of the like charges. 21 This also 
checks with the result expected at large separation for particles whose mass and charge are related 
by mi t 2 = (e 2 1,2/4^7) : / 2 . For opposite charges, such a cancellation should not occur. However, we 
have not been able to obtain a rigorous solution for a pure particle state of two opposite charges. 

All the usual Newtonian theory results may be obtained from (7.5) or from the corresponding 
equation [(3.8) of V] for two bodies in the appropriate limit. This "dilute" limit consists in regarding 

gravitational self-energy \ p4> as part of the source, i.e., V 2 = \p = \{po + \p<t>)- Eliminating p, one obtains 
V 2 <f> — \ po(l — \ (j))^ 1 . In terms of \ — 1 ~ \ 4>i this is just (7.2a) in the neutral case. It is interesting that this 
argument yields the rigorous field equations in the frame (4.22) for this situation. For the point electric charge, the 
same argument with po replaced by po + \ p e <t>e (4>e being the electrostatic potential) leads to an equation for (f> which, 
while different from (7.2a), does yield the correct total energy. 

20 It is interesting to note that, even though m — for the neutral particle, this does not imply that space is 
everywhere flat. The metric is indeed flat for r > e, but rises steeply in the interior. For example, J °° d 3 r^/^g 3 R = mo, 
which shows that space is curved at the origin. 

21 This cancellation implies that the solution should be static (since there is no initial potential between the parti- 
cles). This has indeed been shown to be the case for mi, 2 = 2)61,2! by Papapetrou (1947). 
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mo and e small compared to e before passing to e = 0, thus preventing the strong (non-Newtonian) 
gravitational interactions. Alternately, the "dilute" limit represents a perturbation expansion of 
the theory in powers of the coupling constant 7. From (7.1) one easily sees that 

E ~ too + (e 2 /4vr - 7m 2 )/2e + 0(l/e 2 ) . (7.6) 

[For two bodies, the Newtonian and Coulomb interaction terms (eie2/47r — 7(mo)i(mo)2)A'i2 also 
appear.] Since mo and e are unrelated, there is no cancellation in the self-energy term of (7.6). In 
fact, the perturbation expansion consists of an infinite series of more and more divergent terms. 
The inapplicability of such an expansion is shown by the finite rigorous answer to = 2\e\. If one 
attempted to use standard perturbation renormalization techniques, one would indeed find that 
the theory was unrenormalizable (see V). 

7.3. Determination of full metric due to a charge. Stability of the particle. Vanishing of 
self-stress. We have been able to find the self energy purely from the initial value (constraint) 
equations. The full solution of the problem consists also in the specification of the A, Aj(<7o^) for 
our frame as well as the time development of the system (fuller details are in VA). The equations 
determining A and A, are obtained by taking the time derivatives of the coordinate conditions 
(4.22c,d). These are basically linear combinations of the right-hand sides of (3.15a,b) [including, of 
course, the coupling term from (6.7)]. Fortunately, the initial values and the coordinate conditions 
for our problem reduce these equations to quite simple form. At the initial time, (3.15a) becomes 

doga = N i{j + Nj\i . (7.7) 

Taking the time derivative of (4.22c), one obtains a homogeneous equation for A, which has the 
solution 

A i (r,0) = 0. (7.8) 

The relevant part of (3.15b) is now just the dotr 11 equation. The coordinate condition (4.22d), the 
isotropic form of gij, and the constraint equation (7.2) simplify this equation to 

d m ( X 2 d m N) = \ N{ X - 2 £ iL £ iL + m <5 3 (r)} . (7.9) 

For the point particle, the solution for A is just (see VA): 

N = (1 + \e\/S^r)- 1 = x 2 ■ (7.10) 

Note that the total solution for g^, is everywhere non-singular (except, of course, at the particle). 
In VA, it was shown that this coordinate system led to a completely non-singular metric g^ v initially, 
for a spread-out distribution of charge and mass with arbitrary mo and e. (This is in contrast to the 
standard Reissner-Nordstrom metric in isotropic coordinates for which A always has a singularity 
for small enough e.) 

Further, the point charge is a stable object, since the equations (7.8) and (7.10) for g^ u initially, 
lead to a static solution (dtg^u = = 0). The cancellation of the (repulsive) electrostatic 

self- forces by the gravitational ones is made clear by examining the components of X y ', the total 
system's spatial stress density. The conservation requirement on the total stress tensor implies 
that V\j = -1°\ = for the static case, where X*-? are the total system's spatial stresses. Thus, 
in the notation of the orthogonal decomposition, X lJI reduces to X*-? = % i i TT + since it is 

transverse. Spherical symmetry means that < Z t ^ TT , vanishes, since no preferred transverse direction 
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can be distinguished. Hence, % lJ has at most one independent component which may be taken as 
T M . For our static point solutions T M (as calculated from either the Landau-Lifshitz or Einstein 
pseudotensors) vanishes everywhere for arbitrary tjiq and e. In the rest frame, then, ST 1 " = p 8q <5g 
where f d 3 r p = m, the clothed mass. In a moving system, therefore, 1^ = p(dx^ / dr){dx u / dr) (r 
is Lorentz proper time). The vanishing of T lJ in the rest frame, then, is necessary for the structure 
of the total stress tensor to be that of a renormalized mass m. (This requirement is stronger than 
the usual one that = J d 3 rT° ^ transform like the energy-momentum of a particle. 22 ) Thus the 
point charge is a completely stable object, without any ad hoc pressure (cohesive) terms required, 
and its mass is completely determined by its field interactions. 

7.4. Comparison with standard Reissner- Nordstrom solutions. It is interesting to compare our 
results with the standard discussion of Schwarzschild (neutral) and Riessner-Nordstrom (charged) 
solutions. In these approaches, no bare matter parameters were ever introduced, so that the self- 
energy problem could not be formulated. The source term used in the standard treatment is the 
Maxwell stress tensor, together with a matter source term of a perfect fluid with four- velocity C/ M : 



The scalar "proper rest-mass density" poo is, as can be seen from this equation, the mass density 
in the locally inertial rest-frame, while po is the pressure in this frame (see, for example, M0ller, 
1952). In our treatment, on the other hand, the matter was introduced dynamically rather than as 
an externally prescribed source. This was accomplished by including a Lagrangian for the particle. 
Comparing (7.11) with the source terms in our equations, 23 one finds that poo = m oX & <5 3 ( r )> 
which shows that pqq includes "clothing" effects through the x ~ 6 -factor. The pressure terms in the 
usual source tensor (7.11) are introduced expressly in order to make the distribution stable. Since 
a pressure term summarizes the presence of other forces (not being treated dynamically), these 
forces would contribute to the clothing of the original mechanical mass and hence the parameter 
mo would now represent this original mechanical mass plus the clothing due to the forces giving rise 
to the pressure term. In our analysis, where no phenomenological pressure terms are invoked, one 
cannot at the outset ask that a solution be static. In fact, extended initial distributions of matter 
and charge are not, in general, stable, although the point limit does have this property. 

8. Outlook 

8.1. Discussion of quantization. In this survey, we have seen that general relativity may be 
viewed as an ordinary classical field, once the meaning of the coordinate invariance has been ana- 
lyzed. It was then possible to treat the dynamics of the gravitational field according to techniques 
common to classical field theory. In this way, many physical properties could be directly understood 
from their counterparts in, say, electrodynamics. For example, the canonical formalism provided a 
unique definition of energy and gravitational radiation. The gravitational field, of course, also has 
aspects not found elsewhere. In particular, its sources are the total energies of all other fields. The 
attractive, static interaction, part of the total energy provided the possibility of a compensating 
effect on the flat-space self-energies of other fields. Thus, a stable classical point electron of finite 

22 The standard static Reissner-Nordstrom solution has 1" / but J d 3 r1" — 0. Thus it obeys the weaker condition 
on P M but its total stress tensor is not that of a particle. This is due to the presence of the phenomenological pressure 
terms which were needed there to stabilize the particle. 

23 In VA, the comparison with (7.11) was carried out in terms of a "dust" model of the particle, that is, with a 
Lagrangian describing the dynamics of a continuous distribution of matter in gravitational interaction, and without 
phenomenological pressure forces. 
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mass exists when gravitation is included. 

A realistic elementary particle theory must, of course, be formulated in quantum terms to 
determine whether the quantum self-mass is also finite. A full quantum treatment includes the 
quantization of the dynamical modes of the gravitational field, and not only of the effects arising 
due to a g-number source. Since a complete set of Poisson bracket relations among the classical 
canonical variables has been obtained, a correspondence principle quantization may be performed 
immediately by transcribing the P.B.'s into commutators. Such a quantization may indeed be 
valid when appropriate care of the ordering of the canonical operators in the non-linear parts of 
the Hamiltonian is taken. However, certain consistency criteria proper to quantum theory and 
extending beyond this possible ambiguity must be examined. The reduced system (involving only 
the canonical formalism and its two degrees of freedom) does not represent the full statement of 
general relativity: one also has equations to determine go^, for example, which are not part of 
this canonical theory as such. Furthermore, one encounters these variables as soon as one makes a 
Lorentz transformation from the initial frame, 24 an operation that must be allowed for any sensible 
quantum theory. Since the equations defining go^ are now quantum ones, one must establish the 
consistency of the ordering of the Hamiltonian and the go^ equations in one Lorentz frame with 
those in another frame. Finally, we saw classically that there was an infinite number of a priori 
equivalent sets of simple canonical variables with conserved Hamiltonians, each of which could, 
of course, be used as a basis for such a quantum scheme; quantum mechanically, however, the 
relation between these sets of variables need no longer be one of a unitary transformation due 
to the operator character of the variables. Hence these are no longer a priori equivalent starting 
points for quantization. The classical canonical transformations among these sets will cause the 
coordinates associated with one set to be functions of both the canonical variables and coordinates 
of the other. This would lead to the phenomenon, in the quantum theory, of the coordinates of one 
set being q- numbers when expressed in terms of the variables of the other set, 25 as was discussed 
in II and IVA. 

In view of the many ambiguities which could arise in an attempt to quantize consistently at 
this level, it would seem more fruitful to return to the Lagrangian in four-dimensional form, i.e., 
Q^ u R^v\F a \p\ and try to repeat our reduction to canonical form within the framework of quantum 
theory. There, one can use the manifest Lorentz covariance of the original Lagrangian as an aid 
in proving the Lorentz covariance of the canonical quantum form that should arise. Further, the 
ordering ambiguities are drastically reduced: since at most cubic terms enter in this Lagrangian, 
one can show easily that there is a simple three-parameter family of available Hermitian quantum 
Lagrangians, all of which are generally covariant. These different orderings of the Lagrangian differ 
from each other only by double commutators, i.e., by effects of order h 2 . The basic requirement of 
consistency between the Lagrange and Heisenberg equations of motion should single out one of these 
forms, since commutators of Bose fields do not contribute to Lagrange equations but presumably 
affect the Heisenberg ones. In an investigation based on the four-dimensional form of the quantum 
Lagrangian, one ultimately expects to arrive at canonical forms very similar to those obtained here. 

24 In electrodynamics one similarly has that the gauge functions (i.e., the scalar potential and longitudinal part 
of the vector potential) mix with the dynamical variables (i.e., the transverse parts of the vector potential) when a 
Lorentz transformation is made. 

25 A similar situation could arise in particle mechanics with a transformation i = t + f(p, q). Here one would not so 
transform, since a preferred time coordinate t has been decided upon; in gravitation, however, no such single preferred 
coordinate frame exists in the interior, at least classically. Whether the quantum theory may force a preferred frame 
through the consistency requirements is not known. 
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Thus the classical results should represent an excellent guide in formulating the quantum theory. 6 

8.2. Speculations on quantum self-energy problem. Leaving aside these technical questions 
on the rigor of gravitational quantization, however, one may speculate as to the quantum effects 
on the self-energy problem. The numerical value obtained classically for the mass m of a point 
particle with electronic charge, m = e/i&ir^) 1 / 2 ~ 10 18 m e , is much too large. Of course, one would 
not expect classical theory to give correct numerical values for masses. Any realistic discussion 
of self-masses must be based on quantum theory. However, if the effective flat-space cutoff (a ~ 
[e/(47r) 1//2 ]7 1 / 2 ~ 10~ 34 cm) obtained in this paper were to hold also in quantum theory, 27 the 
numerical values for the mass and charge would be quite different. For example, using such a 
cutoff in Landau's estimate (Landau, 1956) for the renormalized charge, one finds e r 2 /47r w 1CT 2 
(essentially independent of the bare charge) as noticed by Landau. Thus, Landau obtains 

e 2 = e 2 [l + {2/3TT)iy(e 2 /4TT)ln{(h/mc)/a}}- 1 

« [{y / '12tt 2 ) In {(h/mc)/ a}}' 1 (8.1) 

where v « 10 is essentially the number of charged fields. Equation (8.1) was obtained by summing 
the dominant terms in each self-energy diagram. With an effective cutoff of physical origin, the 
usual objections to making an estimate of what otherwise is a divergent series need no longer hold. 
The methods of Landau also yield a formula for the renormalized mass in terms of the bare mass 
and the charge. 28 In the classical theory, we saw that the bare mass did not enter. To what extent 
this is maintained in the quantum theory (e.g., to what extent the bare mass distribution remains 
effectively a (5-function) is not clear. It should also be emphasized that we have assumed in the 
above discussion the simplest possibility, that the gravitational effects on the quantum theory can 
be summarized in terms of a cutoff of the type considered by Landau. 

Finally, the treatment given here has not touched on several major topics of interest in the 
classical theory. For example, the problem of motion (Einstein-Infeld-Hoffman analysis), has not 
been examined from the present viewpoint. Here, it is hoped that the canonical methods will shed 
further light on the manner in which the coupling of particles with the various components of the 
metric determines their motion. A particular issue where these techniques might be relevant is 
the question of radiation by the moving masses. Some further topics involve spaces with non-flat 
boundary conditions, which arise in cosmology, as well as spaces where the topology is not Euclidian. 
It remains to be seen whether an approach of this type can be applied to such situations. 



26 It has been shown in I that, if the theory can be quantized at all, it obeys Bose statistics (as intuitively expected). 

27 An effective quantum gravitational cutoff might, on dimensional grounds, be ~ ^ftc -3 ) 1 / 2 ~ f0~ 33 cm. This 
differs from the classical a merely by the factor a 1 / 2 = (e 2 /Antic) 1 / 2 and would not affect the discussion in the text. 

28 This formula, m = mo(e 2 /e r 2 ) 9 ' 4 ", does not make clear what relation between m, mo, and e r might be expected, 
since the estimate of (8.1) fails to determine e with sufficient accuracy. 
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